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CHAPTER I 
iN PRODUCT TON 


Quality surveillance of military petroleum products 
is the aggregate of measures to be applied to determine and 
Meadmeadin” Cheir Quality. “OuelLity surveillance programs are 
conducted in order that required petroleum products will be 
available in a condition suitable for immediate use. Their 
ultimate purpose is: (1) to insure that no life is ever 
lost or equipment damaged or destroyed through the use of 
contaminated or deteriorated petroleum products, and (2) to 
promote economy by minimizing the necessity of surveying or 
reclaiming any petroleum products because of contamination 
On Octerloracion. 

The success of any such program is dependent upon 
several factors, not the least of which is the maintenance 
of the highest standards of reliability in the testing lab- 


oratories. 


PpOweatece (Cr MaborecOny Kedlataoil ey 


A chemical analysis has been compared to an elastic 
VemeSele cs never giving they sate secu twice,* How is one 
=OxKnNOw then if slaboracory tests are “right”? “Ene. fact is 
that any decision regarding a petroleum product based on 


laboratory test results is a decision under uncertainty. 
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Decisions under uncertainty always involve a risk of making 
the wrong decision. This is of particular concern when test 
mesules OL, a- Petroleum Produce, border Om accepeabi lity 
himiesss in Order “co properly evaluate the risk of misclassi- 
mic DOLecriLine maceiial” At Soimporcane: Ge Know how much 
stretch or shrinkage to allow in reported test results. 
Common sense dictates that it is also important to reduce the 
mic Oy —6eonci ng, ene SlAaSstIGicy @& Ene vardstick as muchas 
mene Conomicaliy teasible.. fine, eceonomiecsvor =reliabiticy 
control are probably most apparent when considering a com- 
Reine Lal IapOPicae On kor which ene Coses Of reliability conero. 
ana the costs of wrong decisions can be quite accurately com- 
puted. Consider a refinery laboratory where small devia- 
melons coule be expensive ones. When mixing a blend, 4a small 
excess per sample unit of an expensive component could add 
BO tO Many dollars in excess Costs 1m a GCOnelnucUusS process. 
J. T. Walter cites a report by one refinery of losses of one 
mitiion dollars per vear ona ‘single operation due to eile bart y 
give-away.“ Conversely, asdeticiency. could cavuce se icceilon 
of a product by a customer and add the costs of reprocessing 
wor ee (OrOoGcuct. | 

in. vegara co miditary dpplicacions.. consider ehei cost 
of delay in GiLseharging a renker Ys cargo of defveling “acshie 
while additional samples are tested, if the first sample 


results indicated that the quality was suspect. As a more 





sobering example, we might visualize a heavily loaded air- 
craft faltering on take-off and crashing because of loss of 
power due to vapor lock. This could result from mis-classi- 
fication of unfit fuel based on unreliable test results. 
From these examples, the importance of the reliability 
®e Jaboratory test results in any attempt to control quality 


should be evident. 


Pecan nr ott rOre EO Control Military Laboratory Reliability 


At the command level, the maintenance of the highest 
standards of reliability in testing laboratories is depend- 
ent upon the ability to detect apparent trends toward unre- 
PraoiLicys Un pursuit “Of s:haS coalsce correlation testing 
Beeocgtamines been Seu, UD wWlehinia Majer militery areca Command 
acea Pare Or ALS G@uality Surveillance program, “Identical 
Samples OF AViet1On Gasoline, MOoeer gasoline, jet fuel, 
diesel fuel and lubricating oil are prepared and distributed 
tir—annualivy to each Of tem participating -laboracories.* “The 
results are summarized and the average value of all observa- 
PUlOnG. tS GSeeCrinined TOM Cache test, Repredcuecaniiliey aimics 
are then computed for those tests for which a method of 
determining reproducability limits is given in the applicable 
American Society for Testing and Materials (ASTM) Standard. 
Re Proecucabiiity dames can be computed for about seventy 
Five per cent of the tests. The test results falling out- 


side of these limits are indicated by an asterisk. A Summary 





of Laboratory Performance 1S prepared which tabulates by 

Gti ie, ~ lee MUmber rE ECSUSe reported Lor wale re produeEe— 
ote em eS IA brew cOMpuLeo oneGerlie) Per Gent iwitonin ceprocuc— 
Hoi nev elimite.- Maen summanyeineludes the, tabular Gata for 
each of the two preceding series of tests as well as for the 


current series. 


Eueose Of the Thesis 

The purpose of this thesis is to investigate some 
Seacrst ical methods Of Ereating hei Gata obeaimea echrougn 
the military area command correlation testing program 
described above to extract more definitive information from 
them concerning the reliability of the participating labora- 


tories' test results. 
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RELIABILITY 


This Chapter discusses types Of measurement error and 
their effects, and defines the associated terminology as it 
will be used throughout the following chapters. 

Also defined are repeatability and reproducability as 


used by the American Society for Testing and Materials. 


GrUSES Of UNRELIABLE DEY 


Scarborough points out that all measurements are sub- 
Neer CO Ehree Kinds OL-error: “systematic or Constant Grrors, 


mistakes, and accidental errors. 


Syocemet ie GOrs 


Systematic or constant errors are those which affect 
all measurements alike. In regard to laboratory test results, 
they could for example, be due to improperly calibrated equip- 
ment or due to consistent but incorrect operative techniques. 


Systematic errors are usually evident as a constant bias. 


Mistakes 

Mistakes or blunders are due to carelessness primarily 
PMU Kang OF reCOrcing CaseryeaciOns-.. «Tne 2eact that they “do 
not follow any law makes gross blunders recognizable as 
isolated data points. Minor mistakes, however, may be dif- 


fret, FO -CeLCeCE. 





AccigGemtal Errors 

Accidental errors are those whose causes are unknown 
or undetermined. They are usually small and they are con- 
peered (6G EORIew ene viavc ersehiance . » Consequently elney sarc 
also referred to as chance errors or random errors. 

The mathematical theory of errors deals with acciden- 
malecrrers Only. “~[mateis Co Say, Systemeacic Errors and gross 
blunders are due to assignable causes and can therefore be 
optionally eliminated, controlled, or accepted. Accidental 
errors however, cannot be avoided and are bound to occur with 


a measurable probability. 
COMPONENTS OF RELIABILITY 


Reliability, precision, and accuracy have been defined 
in various ways. All are comparative or relative terms 
rather than absolute measures. Arbitrary scales for their 
measurement must be established based on predetermined stand- 


ards. 


Precision 

Precision is a quality of a set of data that describes 
the degree of dispersion of the values. The lower the dis- 
persion or scatter, the higher the precision. Single mea- 
Surements Cannot be considered Co, be."preci se” Yor “not 


precise," 





ACCUMAeY 

MeeCiuaGy~wseo Quality Of assim lS mmSsasuremene on 4 
series of measurements that expresses the degree to which 
the single measurement (or the average of the set of measure- 
ments) conforms to a predetermined "true" value. High 
Se@euracy implies close agreement =O Ene. predetermined seand— 


ard. 


Mearget Analogy 


The relationship between precision and accuracy is 
best explained through use of the target analogy. 

Piguses4-k alliscrates four ueuDiImGgs Of —EWwelve Gshners 
in a target. Target A illustrates a grouping which is pre- 
cise but not accurate. The shots are ina Evie. GClhasre = but 
Sensi Cette s hy ameomeVed 6 VOM Ce ene ornOr wre ne Veergoeracea, 

This is analogous to the accompanying frequency histogram of 
laboratory measurements in which the measurements are grouped 
close together but their average value is considerably removed 
from the true value of the property being measured. 

Target 8 Liluserates accuracy WiEnout ~precis tony. e1] he 
SOEs "Gluster ‘Around The -ceneer Or Ene: ta-ceu 2m a 2anconm 
fashion but are widely scattered. Likewise in the accompany- 
ing frequency histogram, measurements are relatively evenly 
distributed around the true value but are relatively widely 


dispersed. 
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PRECISION AND ACCURACY 
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Target C illustrates a dispersion of shots which is 
Meise accurate NOY ee eae ACA ene Shots wave. widely 
scattered and also do not form a uniformly dense pattern 
around the center of the target as they did in B. 

Target D illustrates good marksmanship, that is 
marksmanship that shows high precision (tight clustering) 


and high accuracy (good centering). 


standards 

ine yiolM Sedna GseOn Petroleum Produces cand siupin = 
cants provide convenient standards of precision in the form 
of Repeatability and Reproducibility amounts given with the 
Gescription of the test method. Repeatability, is defined 
by them (ASTM) as the greatest difference between two single 
and independent results by a single operator in a given 
WeaecOGeecOuy, =ae Com pe secenSi@ered aceepreolere =. ee mince, 
five per cent confidence level. Reproducibility, is defined 
as the greatest difference between a single test result 
obtained in one laboratory anda single test result obtained 


mireanOuner Laberatory thet can be considered acceprable aL 


the ninety five per cent confidence level. 





CHAPTER III 
FUNDAMENTAL STATISTICAL MEASURES 


fee OCuUCt LON 

This chapter briefly discusses the fundamental stat- 
istical measures which are applied or considered in later 
chapters. 

In the first part of the chapter the measures are 
gerined. Methods Of eStimeting population parameters from 
sample statistics are presented in the next section followed 
iby ea COMmparison > Of the relative efficiency Gf the various 
Sseimaters. Pineally a cisecussion 1S Given of some of the 
advantages and disadvantages to be considered when choosing 
each Statistic Or estimator. 

Frequent reference will be made to normal populations 
@rValSEribuLTOns Of Values. ~The theory of the normal dis— 
tribution stemmed from work done by Karl Gauss and, for this 
reason, the normal distribution is sometimes identified as 
EewGalicslem Catster burton. “Ite Mermealecur ye —1s cenined 
Meachnemeacically as 


Zz 
eC) = —+— exponential - a (3-1) 
GV 22 7ic ZG 


in which p is the mean value of the variable and o is the 
Stendard ceviation, DbOthn OF which ere desceribed in this 


ehapte r. 
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In the context of equation (3-1) f(%) is known as a 
MORObo DVI te - CSnstEy funceton."- Bor aay peoeabilicy eens ey, 
munction, £(X); the probability Ehat a value of X lies in the 


: < = 5 ° » r a << 

interval ae Es x, is given by P(X, So Ai) 
“A 

ye ee) oar (3-2) 


gal 


lA 


P(X, < Xx 


Thus, the probability that a value X lies between limits x 
and x, ieee “COC e vores Under GMS uprObd ol te censtEy 
function £(X) between the two limits. This area is shown 


in Figure 3-1. 


FREQUENCY 


RELATIVE 





FIGURE 3-1 


A PROBABILITY DENSITY FUNCTION SHOWING THE AREA EQUIVALENT 
TO Tre PROD A Beitr 2A 2 bles Belwiks xy AND Xz 


A Or te “GUS tiie On eos ce Var sao Le Sein re Se 
Signifies that the probabiilty of X being between any two 
datiniceS, a hd on) Sa VEN, by Soude lon | s=20. 6 tone tSses 

L u 


Somawiont( 3—L) ior tne wader imei on On, pet). 
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POPULATION TEAS aMETERS 


Neesures of Central fendency 

A UnLVerse or population is the totality of all? per— 
tinent observations that might be made in a given problem. 
If these observations are normally distributed, they will 
be symmetrically dispersed around an "average" or central 
value. The central tendency of the population is of funda- 
mental interest in any statistical analysis. 

The ARITHMETIC MEAN or ARITHMETIC AVERAGE, uw, of a set 
of N values, Xs, is defined as the sum of the set of values, 


divided by the number of values in the set. 





N 
EX 
POPULATION MEAN = yp = == (3-3) 


The arithmetic mean is the most commonly used measure 
of central tendency and is the value generally intended when 
the term "average" or “mean” is mentioned. 

The MEDIAN is the middle value of a set of numbers 
PaGaageq Tivasecenging "Or GSsScenaing Order aceoraing co value. 
For an even number of data points, it is the arithmetic aver- 


age of the two middie values. 


SO@24On Values 2.4 SO. sors values 


The MIDRANGE is a point halfway between the largest 


and smallest observations. It 1s computed as the average of 





13 
ene Gibest and last values of a set, ordered according to 


value 


Ay = xy | 
MIDRANGE = er Where Ay 5 A5 Se a, fe yy (3-4) 


Por <2 mormally Gistributed spopuldiion, ene ariznmet ic 


mean, median, mode and midrange have the same value. 


Measures, OF Dispersion 

The second of the two most fundamental measures in 
Seat lsStical analysis iz Gispersion., Dispersion is a measure 
Poa ee rexrene COrWh1en. Che Percinene Observacions comprising 
the population are scattered around a measure of central 
tendency. It may be viewed as a measure of precision or the 
@ensistency of ,.or the variation in, a Set of measurements. 

The RANGE is the simplest measure of general vari- 
ability. This is the difference between the highest and 
lowest value of an entire set of measurements. 


RANGE = w = Ke oe Where Xy < Xo te es Xi (3-5) 


The AVERAGE DEVIATION is the arithmetic mean of the 
absolute deviation of each value of a set of data from the 


central value, 





AVERAGE DEVIATION = A.D. = (3-6) 


The VARIANCE, or MEAN-SQUARE DEVIATION, is the aver- 


age of the squared deviations from the mean. 
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VARIANCE = 0° = a) 


From a mathematical standpoint the variance is the 
basic measure of the Gistribution, but a very Lrequentciy 
used measure of dispersion is the STANDARD DEVIATION, or 
ROOT MEAN-SQUARE DEVIATION which is the difference between 
the mean and the point of inflection of a normal curve. The 
standard deviation is defined as the positive square root of 


the variance. 


SIAN PARES DEVIAT ION SG N 57 (3-8) 


BSTIMATING POPULATION PARAMETERS 


A statistical estimation problem involves selecting, 
on the basis of sample information, an estimate which approx- 
imates the value of a population parameter. Estimators are 
used when practical considerations militate against direct 
measurement of the population parameter. © 2 he? CoOse: OF 
testing exceeds the value of the added benefits, it is 
uneconomical to measure the parameter directly. If the popu- 
lation is infinite, measurement of all samples is physically 
impossible. If the test required to measure a particular 
property alters, consumes or otherwise destroys the product, 
measurement of all sampies is not useful. These considera- 


EVOns (apply EOucesuing Of Duk Peerole um produicrscs 
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mhne problem of “determining the “best” estimator is 
varied by the circumstances of the situation. In general, 
the "best" estimator is one which has a distribution con- 
centrated near the true value of the parameter and en Gar 
be applied economically. 

AMOnG, (he Stotastical ecxteerie fOr sevetuauing osu 
ieearsS are Unblaseagness, consistency, and efficiency. 

ie: ilkedS On sean est I Mace mr ost ne Cit torenee, between she 
mean of the distribution of the estimator and the true value 
of the parameter being estimated. An unbiased estimator then 
is one which has a distribution having a mean value exactly 
equal to that of the parameter being estimated. 

An estimator is consistent if the probability that an 
Seema to \wili Var, from thie enue Velveteen the peranecter iby 
more than any given amount can be made arbitrarily small by 
increasing the number or observations in 6 sample. More 
Simply stated, an estimator is said to be consistent if the 
reliability of the estimate becomes greater as the sample 
size 1S increased. 

ine EP EMe IOC VOL an cCoeiid eon Gio. Clee iyecjceh Veer Lem 


which will be discussed ina later section. 


petuMecons Ob the Pooulacion Mean 
The sample mean, or arithmetic average, is an unbiased 
Ssciumator Obs POpUUATION mean Lor any tyre lon population. 


For a normally distributed population, the sample median and 
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the sample midrange are also unbiased estimators of the popu- 
lation mean. The purpose of the estimator is to approximate 
the value of a population parameter, however, the presence 
of extreme values in a set of sample observations (particu- 
larly a small set) could greatly distort the estimate. To 
manimize distortion, various modifications of the mean, 
median, and midrange may be computed. These modifications 
mee Variously identified in the literature but the ma jority 
molLlow two general patterns; 

a. Outlying data ina set are excluded from com- 

PutaeloOn On “eNewmea), “Medtene On Widtrence 

b. An equal number of values from the lower and 

upper ends of an ordered set are excluded from 
computation of the mean, or midrange. 

The elimination of equal numbers of values from both 
the high and low ends of the ordered set will not of course 
change the median. It should also be obvious that the median 
is a Special case of both the symmetrically modified mean and 
the symmetrically modified midrange. Given a set of six 


values, the following symmetrically modified means may be 


generated: 
(xX, + X, * X, + X2) 
So 2 3 £. 5 
Exclude Xy and Xe aX aera ie ge ae ae (3-9) 
Se el 
Exclude ax oh xX X, = .X, =--oo = Median (Seo) 


ae, eee) es 3° °4 2 


® S99 Sues Siar at 
a 


Sette 





ey 
Again using a set of six values, the following sym- 
metrically modified midranges may be generated: 
CS a) 


Exclude xy and Xe = ofa oe pe oe (3-11) 


Carnes) 
Exclude Ky, Xo, Xe, Xe 3f4 Gi ee ere Median (3-12) 


General equation for computation of symmetrically modified 
mean: 
N-A 
ae 


= a 
(A + 1)*(N - A) (NDA) 


(3-13) 
Where: A = number of values to be eliminated from each 
end of the ordered set. 


General equation for computation of symmetrically. modified 


midrange: 


(A + eta =) 2 (22a) 


Mines PE INGcipel GScvantode «Ob Arolerarily discarding "dara 
trom both Ends of en ordered set 18 the Simplicity of che 
procedure. It has the disadvantage of automatically reduc- 
Gs ENeten FeeCei Ve Size, Ob ene Sem e, Ciscarading geod cara 
aLOnge wii any “bad "data. Bor ne most ScielratiecslLyy 
accurate work, eee prefer to discard members of a 


sample set on an individual basis.” This may be limited to 
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eliminating only those values known to have been influenced 
inv Some Cat Sem POrereiy eOvr Me beStuOm me, GOte 5 ey May goo se 
accomplished by following some statistical rule by which 
values can be discarded with a predetermined error risk. The 
method of Dieore for testing extreme values, being a nonpara— 
metric test, requires only the available sample observations. 
Py xen ls Method makes USe Of Critical values jor ratios OL 
differences to be expected at various probability levels and 
for different sample sizes. If the observations in the 


sample are ranked in order of magnitude as follows: 


the ratio for testing the smallest extreme is: 


ey Seam one 
Z ] 
rt (3-15) 


ie) Sa mes 


and the ratio for testing the largest extreme is: 


Tae appropriate rable for \Vverlouc, sample “sizes 45; 


sample size 3 to 7 2 4i69 


sample size 8 to 10 : ri4 


Sample size Jil to 13 xr 


oa 


sample size 14 to 30 fe 
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Tables giving the maximum expected values for Dixon's 
ratios are widely reproduced in statistical texts. If an 
observed ratio exceeds the maximum expected ratio, the 
extreme value may be rejected with the risk of error set by 
the tabulated probability level. Another method based on 
Beat istical “probability 1s ‘the trial and error method, Taus 
method requires an independent estimate of standard devia- 
meron, A trial mean 1s computed from all the observations in 
the sample. Confidence limits at some reasonable level, say 
ninety five per cent, are then set around the trial mean. 
Bny extreme data point outside the ninety five ver cent con— 
fidence interval is assumed not to have come from the same 
population as the rest of the data and is rejected. A new 
trial mean and confidence interval are determined based on 
the remaining data. The entire original set of observations 
is tested against the new confidence limit and additional data 
points are rejected and/or previously re jected data points 
are picked up. The process is repeated until a stable set 
of values is established, that is, no additional data points 
are picked to) Or eee by the newly computed confidence 


interval. 


Boelmacors von, POOLAciG., iS er o1On 
Since the sample mean may not be identical with the 
population mean, the sum of squares of deviation of the 


individual sample values from the sample mean will be less 
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than the sum of squares of deviation of the individual sample 
values from the population mean. The variance of the sample, 
computed from the sum of squares of deviation divided by n, 
the number of items in the sample, will therefore be smaller 
than if the sum of squares has been calculated from the true 
Bepulation mean. To overcome this bias, the population 


variance is estimated from a sample by dividing the sum of 











Pala mes -OL- deviation by n> | instead ef in. 
ESTIMATED POPULATION VARIANCE 4% = s2 cea 
~ (xX. - xe | 
2. ei 2 i : ae 
oi en ts ee. | We ae (3-18) 





An unbiased estimate of the population standard devi- 
ation can be obtained by multiplying the square root of the 
estimated population variance by 4 correction factor which 


varies with the type of distribution and the sample size,’ 


ESTIMATED POPULATICN STANDARD DEVIATION = 5 = kK, V Ad (3-19) 


PonwarnOrmealy “ci Seriouced pop etone. 


Witte! ie ou dar ener 
Wipes aoe en ee he 


Ry Ce ae ia oo eee 


Ba Me, serie ee 
ne > 4 Kn a Fie mea) 
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he Scmpleeranige, ww, Muleiplied by Ene appropriare 
correction factor forms an unbiased estimator of the popula- 
PLOlmouam@Cdara Gevidgelon. Alables Giving Gosrece on Toactors 
meee applied toe the range can be found in readily availabie 
eos” and erigieoe tea and appear to be based on work 
done by Pearson.?? 

The sample average deviation, A.D., multiplied by a 
Seurecreilon f2CcCor LOLs sdnoc rer Unbpra Sed ses eitiduonw oF Jcie 
BOeCuUlAarLoOn Stendard deviation. -Still enotners and one which 
1s easier to compute than the average deviation, is the 
modified linear estimator. Tables of average deviation 
estimators and modified linear estimators were developed by 
Dixon and have been published in at least one book which he 
has co-authored.?? 

Table I summarizes the range, average deviation and 
modified linear estimators of the population standard devia- 


tion for sample sizes two through ten. 








Sample 


Size 


MOTO P=) Ove 1) = is 1G! 4) 


4 
© 


DiPeA 


DIt 


DIPe 


DIED 


(eos al 


UNBIASED ESTIMATORS OF THE POPULATION 
STANDARD DEVIATION 


(a) 


(aii) 


(ao) 


(inky 





U2 SS05K 
Oza 7 O7R 
O0.4857R 
O.4299R 
0..39460R 
O25 095R 
OU 3o bk 
0.05675 
O33249R 


where L 
and H 


where L 


wnere L 





PAs, stem 


Median 


O..6662 
Oyo 0Uc 
Ono 20 
C23S0LG 
Ue SG? 
Oy 2032 
Cr 8 
Coe 
Ce Bs es 


- 
| 


DIFsA 
DIFE 
DIFA 
DIFB 
DIFA 
PLS 
pie 
pie 
re 


Peo uA 2 


Modified 


Linear 


0.8862 
0.5908 
0.4857 
0.4299 
0.2619 
0.2370 
0.2197 
0.2068 
0.1968 


Cn 2 arom 


Peon ae yer 
(mes) 72. eo an 


IB AG 
Die 
Pe 
eee 
DIED 
Die 
Ee 
DLE e 
Die 
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Dep le len Cy OF BolliiArors 


The efficiency of an estimator is a relative criterion 
based on variance. The variance of an Senor is the mean 
squared deviation of the estimates from the true value of 
the parameter and the most efficient estimator of a given 
parameter is the one having the smallest variance. Efficiency 
[=o rineg yas Ehev rout OL Ghee Variances Or “ne @=samp ing 
Sees oet ions Cf Une Most. Srerrcr1ent estimabe land ene @seie 


mate being compared. 


Vor lanes (Ob mete. NOSc eerie Glen Ure cedmacos 
Variance of the estimator being compared 


Par eClonNew = i = 
Hence, the efficiency of the most efficient estimator is 1; 
less efficient estimators have an efficiency of less than 1. 

Relative efficiencies are approximately the ratio of 


sample sizes which will give equal precision in the estimate. ~~ 


ier Cloney ©f Population Meanststimatorse 

The sample mean is the efficient estimator of the 
DOpULAeLonm means “lhe! variance oF the Sapling Cisuaiou Lon 
of the mean is oa Prom -ehe Celanvelonm Gf err resencey «i. 
follows** Enae Vhe “Vaenilenee OL Ene  sanpling Ccuserioucion vor 
Shs Ulbteseciestiamareor Of Tae meen “or, enorme. peru erion is 
o- /nE. 


The erficiencies of the medien and midrange for 


various sample sizes are given in reference 14. The 
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efficiency of the median is high for small sample sizes 
decreasing to a value of 0.637 as n approaches infinity. For 
the midrange, the efficiency is also high for very small 
samples but decreases rapidly as the sample size increases, 
eer OaeclIngG Zere ics Mm eppreacnes Anrintey. 

By comparison of the sampling distribution of the 
means of all possible combinations of two values from a large 
sample, it can be shown mathematically that the estimator 
with the highest efficiency among the group is the arith- 
Nomen e vereage (Of snesZecot pereenut le vyelue and ene 77124 pen— 
centile yalue.*> The 25.0 percentile and the 75.0 percentile 
are usually used in practice for large samples because they 
@ectSasionr te rememoenr end! neve Onl ye sltgnhely Jlower 
erriciency. The Limiting efficiency of this modified mid-— 
wenge cOmbinacrioOn is O2S0S as mn epproaches Intinicy. For 
smaller samples, the efficiency of the Average of the Best 
Two increases above 0.808. For sample sizes larger thnan four, 
the efficiency of the Average of the Best Two as an estimator 
of the population mean is always greater than that of the 
median or unmodified midrange. The estimators and effi- 
clencies of the Average of the Best Two for various sample 
Sizes are given in reference 14. Table II gives the esti- 
mators based on the Average of the Best Two for samples of 
Size two through ten. It also compares the efficiencies of 
the median, midrance and Average of the Best Two as esti- 


mators of the population mean for these same sample sizes. 


iT a 
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TPABE ie 


fe PGE NCrRo. Oe Bol LMNIORS Of Tht rOruULAT FON (ie Aan 
COMPARED TO THE SAMPLE MEAN 


a ei ES A LT NP 


Sample Sample Sample Aver. of Best Two 
Size Median Miereance BEE. EStimetor 
2 beOOO 1, OOO ie O® 3 CK vi X-) 

3 0.743 C920 Gao 20 7 (X,, = X3) 

4. O.886 OSES ORE so) =) 74 (XK, 3 X) 

2 0.697 Coos Oseo7 *5(X, a X,) 

6 Ca Te 0.706 O2eo> *3(X. 2 Xe) 

a OLG7 2 0.654 0.849 4(X. . Xe) 

8 O43 Oz610 O283% 4(X, 3 Xe) 

2 ore Ses re 0.843 3(X2 a Xo) 
10 Ome 0.539 0.840 %%(X, + XQ) 


a ee er 


emu lLeoncy >On PObulatIOn Dispersion. BStimqgters 


The 


modified linear estimators relative to the square root of 


Zz 


S”~ have been determined and published. 


efficiencies of the range, 


Average Ceovilecien Jeng 


16 


Dye Sete lete icy 7. 


the range estimator of population standard deviation is 


relatively high for sample sizes of five or less, 


decreases to 0.85 for a sample of size ten 


Sample of size twenty. 


ieely?> lc approaches Zero. 


As the sample size 


lie Si eLerency 


and 


bie 


of an estimate 


LO~U0-70 forva 


increases indcefin- 
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ibeased Mee Che average cdevidtion 1S Greater than theac of an 
estimate based on the range for sample sizes larger than six. 
For sample size ten, it is 0.89. An estimate obtained from 
mao muedtfIcd Milineear ceviae ton inas on Sr rreieney Squall te rer 
meeater than either the estimate obtained from the range or the 
estimate obtained from the average deviation up to sample size 
five. For larger sample sizes, its efficiency is consis- 
tently greater. The efficiencies of the range, average 
deviation and modified linear estimators for sample sizes 


two through ten are given in Table IIl. 


Tasos. cue 


EPPLCIBNCIES OF BSTIMATORS OF (POPULATION 
SIAN DARD: DEVIATIONS AS COMPARED TOUS 


AR NR Ne Se i a Re re 


Sample Modified 
Sa Rance A.D. Linear 
Boel ews 
2 i OO) I-00 OO) 
5 O00 O99 Ono 
A. One G22 298 
2 Oe 0.94 Cae 
6 a98 O90 Ce 
7 sot O02 92 Oy, 
8 O.82 0290 Oe 77 
o Dray, aot C27, 
LG Oks 2 0289 O26 





oy 


CHhOGSING > TAI Sr icen Ale BoOrEeMAvOnRso 


The proper choice of which statistic or which esti- 
mator to use depends upon the problem. Again, the objective 
is the closest economically obtainable answer to the true 
value being sought. 

Quality surveillance at the command level initially 
seeks to detect conditions which may require corrective 
Sewilon. Answers which are to be used Eor management by 
exception can sacrifice some statistical efficiency for 


computational efficiency. 


Central Tendency 

The arithmetic mean is the most widely used measure 
of central tendency. Perhaps the most important reason for 
this is that means of samples of uniform size tend to have a 
normal distribution regardless of the type of distribution 
of the population from which the samples were drawn. This 
characteristic of the sample means permits the use of the 
Rorpmal ‘Cistcribution 1fn Making prcbabiiity stacemencs about 
the population mean with full confidence even if the distri- 
bution of the population is unknown or uncertain. The 
arithmetic mean, being based on all the data, Craws the maxi- 
MUM AMOUNE. OF InEOrmation Erom che samole. -At “the same 
Bitic Mle do Cireecrooroy xt emenGaca, 2 'Signitemecane Gis- 


advantage when sample size is small and the sample mean is to 





Zo 

Dew Ses. OS oh cote eare Of. Ene POD lation Mean, Suen as “ene 
€2ce 7 whenpeme “cencrak Lencgeney valuc vor 4 Correlaeion test 
sample distributed among a small number of laboratories is 

moO Oe USEC 2s ean uesctimare. Or the truce valuc4of the property 
measured. It is obviously important to exclude extraneous 
MaAlue S Ero. Ge COMpULaELON JO: Che sample mean in such cir 
SuuStances.. 

The sample median is a less efficient estimator of 
the population mean when both the median and the arithmetic 
mean are computed from the same number of observations. For 
Samete Size een. Lor Example, §6ii re ene, of “-he= median ts 
O./23. The median, however, has the advantage that it is 
MOu Seriously affected by the retention of extreme’ values on 
a sample. ~/ Pesre oe Cheney lie eit eee eno ato te ete 
therefore, is one hundred per cent since none of the opnserva- 
tions need be discarded. If, as the resuit of a test for 
outliers, three extraneous values were discarded from a set 
of ten to compute the arithmetic mean estimator of the popu- 
Paelon Mean, the Crire1ency Of Utllization of evailabre Gata 
is only seventy per cent. An approximation of tne relative 
efficiency of the arithmetic mean and the median as estima- 
BOre Tiesto Caco. Cal teen oe made. 

Overall efficiency of arithmetic mean: 0.70 (1.000) = 0.700 


Overall eiriciency- Cf Meagan: 12.00 (02/23). "S80. 723 


-_— ss 


te 
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From this it can readily be seen that the choice of the 
Sarit echmetic mean aS Ssuimactor Coes Noe Guarancec Lhe Mosc 
efficient estimate in every case. 

In the same vein, it must be remembered that although 
a more efficient estimator has a greater statistical chance 
Of being close to the true population parameter, this does 
not guarantee that for each sample a more efficient estimate 
will be closer to the parameter than a less efficient esti- 
mate. There is also the question of the relative effort or 
difficulty in finding the mean value or the median value. 
If the data are arranged in an order set the median can be 
located quickly regardless of the sample size. For small 
samples, say ten or less, the median value can usually be 
GeternimeG Sy, anspecuion Telarnivel, Quek y even 32 Ge "dace 
are not ordered. Mathematically however, the median is hard 
EO, Namacure | 

The midrange is a good measure of central tendency 
for five or less opservations but not as good as the mean. 
For sample sizes larger than five, it is the least efficient 
estimator of the population mean. Its chief merit is 
SimMpieoue OF GCaoleuleacien but, being the everage Of tne 
largest and smallest values ina set, it is even more affected 
by extreme values than the arithmetic mean and the same tests 
for extreme values are required. However, the midrange is 


Superior to the mean or median for extremely short-—tailed 
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distri@ations.-” The Average of the Best Two is a means of 
Mine ipLevealiy Creaeing @, SNOre-rell eo discerns buelon: by “chop- 
Ping off the most widely dispersed values. This estimator 
offers several advantages. Its construction is such that 
mine probability OL being -signiticantily eitected iby outliers 
is relatively small, and its efficiency relatively high 


(0.840 for sample size 10). Yet, it is relatively easy to 


eomeu ce. 


Dispersion 

The range is the simplest measure of general vari- 
Pettey and te Very -sasy eOucompuue. or tehne samele Size 2s 
small, say ten or fewer, it 1S a sensitive measure of the 


general variability of the population.*?’*° 


Since only two 
of the data points are involved in the calculation of the 
mange. le tm AWOlWay expresses Ene Vvarsaciton Of Ene cehex 
values lying between these two extremes. Therefore, the 
accuracy of the range estimate of dispersion decreases as 
sample size increases. None the less, the range is an 
Se ereme yeusen) Scteeise1e FOr Stall samples nets oLcen 
meet. In Poleite, Onto! aac Spoor iOn werk: 

The average deviation is sensitive to the variability 
©f the population regarcless of the size of tne sample since 


it is based on all the data. On the one hand, it is an 


obviously reasonable measure of variability for small samples 
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because it is simple to interpret and easy to compute. On 
the other hand, it is hard to handle in mathematical analysis 
SvonG 2O “une USe Of adosoluce values.“* Ee re Sra Leaeeney 
to use the average deviation as a measure of general vari-~ 
ability when the median is used as a measure of central 
tendency because it iS a minimum when measured from the 
median. For a normal distribution, the standard deviation 
is~/7f/2 or 1.253 times the average deviation.?” TE the 
everage Ceviation is Known Erom historical data, the standard 
a= Fle vone Or va Measurement ical pe-eStimacecdl trom tit os re la 
femsOr Sha! Di, 

The variance and the standard deviation are the most 
Prep Cleve ,On. Te veculmawore yor sPOpuUleat ion Cispersiom., (hey 
are harder to compute than the range or the average devia- 
tion but are much less affected by extreme values than the 
range and are mathematically less cumbersome than the average 


Be aeons -— 





CHACT ER iy 
ANALYSIS BY NUMERICAL METHODS 
INTRODUCTION 


The purpose of this thesis as stated in Chapter I is 
£o investigate methods of extracting more definitive infor- 
Veto COnecermiing “Ene. re labia ey Of “he jpaerr lel parang 
laboratories' test results from correlation test data. 

Some statistical methods of treating available cor- 
relation test data sets which will accomplish this purpose 
are examined in this chapter. These methods are applied to 
actual data and the results are interpreted. 

The basis of single observation testing 1s presented 
First and its limitations are pointed out. Next, a method 
of analyzing paired sets of data is described and it is 
shown that two sets of observations are the minimum required 
to estimate the consistency of a laboratory's resuits using 
a proven method. It is also shown that further analysis is 
possible but is dependent upon an adequate degree of pre- 
cision being exhibited by the two observations. 

A method of treating multiple sets of daca follows 
which is shown to produce a measure of the reliability and 
a measure of the systematic error of a iaboratory's test 


results as well as an improved measure of the relative 
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accuracy of results. Two laboratory rating methods are 
described which could be used as supplements to the Summary 
Se baboratory FPerlormance described in Cnepter I.> One metnod 
Provides an. index Of accuracy and’ an. index cf precision £or 
Specific tests. ? Ine totner provides a. laborarcory) ranking 
macex fOr ‘the Tamily Of EeSts associated wiltn-a given produce, 

The manner of presentation of each of the methods for 
Beal yczing che correlecion tese Cata 1s) to, discuss the theory 
and then describe the procedure. The procedural descrip- 
mon tmcludes ililustrecive computations using ectuel <or— 
Belaulon test date Obtained From a ma jor militery commena. 

Terminology used in connection with the reliability 
of laboratory test results is defined in Chapter il. 

Sane statistical measures applied are those discussed 
Pinewopeer iia, Analysis ox Jeaporateny cese BeSUlLTS 2S: noe 
only a problem of statistical estimation but also a problem 
of hypothesis testing. The statistical tests applied in this 
chapter have not themselves been discussed previously in 
this thesis except for tests of extreme values, but they use 
the same statistics discussed in Chapter III. The tests will 


be described-as they are introduced into the problem. 





TESTING SINGLE OBSERVATIONS 
Discussion 


Accuracy limits. “A Minimum OF ECwo séts of observa— 
tions are required to establish an estimate of the precision 
of a test method. These can be repeated tests by the same 
operator using the same equipment to establish the operator- 
equipment precision (repeatability) of the test method, or 
paired duplicates from separate laboratories to establish 
the interlaboratory precision (reproducibility) of the 
method. Once establisned, the repeatability amount and 
meDrOCUCIDLLICY amount Can <~be Used" EO ‘check etme accuracy ‘or 
a single observation when the true value of the property 
being measured is known or can be estimated. 

eG Represent: Ere Mean ~cliteorcice ee veon Dears son 


test measurements. 


n 
It can be shown that the mean difference between 
: —- aoe : : : aaah ao 
pairs, dad, is (2/V Tm) times the standard deviation. By 


Erensposing terms, 4m expression 1S opteined- for compucing 


the standard deviation of a single measurement. 


. OVT 


9 : = 0.88624 (4-2) 


A confidence interval to the true value of the prop- 
erty being measured can then be established around the 
Singie Observation, xis 


ij 


oa 


Confidence range for p = X ZG (4-3) 
Assuming that the single measurement, ae GOMeS+ ir one 
normally distributed population of similar measurements 
ee@recred by a large number. of small srandom factors. 2 is. che 
normal deviate appropriate to the desired confidence level. 
Mine term, = 26, is the “tolerance sec on the precision oF 
measurement X. Therefore, if G is known or can be determined, 


the accuracy of a single measurement can be estimated cor- 


responding to a predetermined degree of confidence. 


Peete ee sm les iP Or a6 aa “ 20 
=. = z2(0.8862d) (224) 
ne Value Ob ZHat. UneS river per neConn Drobaatiliry sever 
ane, die) Ole 
Accuracy limits for Xp 9. 7 » le C6 Cone ao2) co 
0 = oe 4 


These limits can also be expressed as a ninety five 
cei} 


per cent accuracy confidence interval for a single observa- 


tee Os. OMe oats 


1) 
i oe GZ) Se ee i) (4-5) 
Cin eee ae Sea ee) 
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This interval can then be used to test the hypothesis 
that the single observation oe ioe ee oe hed seer aiotaas 


the true value, up, of the property being measured. 
EEOoceoure 


Dota ene aosumpetOucn, | Mnet ray "ata sCOua Casa =o Ere 
test results for a given property eee ESO sa Single 
sample which has been divided and distributed among the 
participating laboratories. Analysis of the data is based 
upon the following assumptions: (A) The sub-divided samples 
are homogeneous, that is, there is no quality variation of 
mne! mMaceraal caSertouted. cto. Ene varvous parerecipating lab 
oratories, (B) The universe of observations for each labora- 
eeormy end ail Paboratovies is norma lif distribuced, (C) Fre 
test procedure has been vroven, tnat is, it is adequately 
described to preclude general eee ere ae of the exact 
procedures to be followed. 

For example, the following single measurements were 
SubiIveeee (eS) Ee Fl Geavitey -O.. o7latlen GasOlias Genie 
63-1700 by the ten participating laboratories in a cor- 


relation test. 





Aeon ee 


MEASUREMENTS OF API GRAVITY OF AVIATION GASOLINE 
SAMPLE 63-1700 BY TEN LABORATORIES 


REED lS A a NN yr a A Ra I NT A NO ee me er a ta nn NE em 


bab. 4 2 3 A 5 6 : 8 9 We. 
Test 


ee omar 2 tOo oe) OFT OF Me Cet 769 tl (Go l2 69.2, 60.2 Co 7607 


— re mm ee ee ee 
I I TOP, PO CE CE LE IE tN GG LN IE a A OE LE 2 LAE ON TE PRLS NL. PT TE LEA I IE AE NR AE tT En i eR TN i FF 


Dees Ont Glee. aCCuraGy. | COMPU ver ence ecSeltatce eLuc 


API gravity of the gasoline using the sample arithmetic 





mean as the estimator. Substituting in (3-3): 
LL 10 Coe. 


The ASTM reproducibility amount, R.,A., described in 
Sie DES yi) eeGan be SUubS2i tuted for he Hineey five. per cence 
Sones denes sin eervalerandes Say in 6A Gaet eis sonear dco 
test the statistical accuracy of the single test result 


obtained by each laboratory. (4-5) then becomes: 








i _ R.A. 


: [A + RA. | 


\ 
i< a = | (4-6) 


Oo 
and the decision rule is: 

If the observed value is between the estimated 
population mean minus one half of the ASTM Reproduc-— 
ibility amount anda the estimated population mean plus 
one halt the ASTM Reproducipility amount, concluce 

-cnen Lesules Goraetnec by tre Jtabcrecory ©6r ca1s tec. 


are statistically accurate. If the observed value 
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lies outside these limits, conclude tnat results 
obtained by the laboratory for this test have 
errors attributable to assignable causes with a 
PPvyewoek weeme tebe « Jom, soot nce wreetic,, 

BPeccrmine une Asi’ Kepreoduciollicy amounc. R24... Erem. ehe 
peancaerd Mechod orClest Eor AP Grevity oe Perroteum 
Products, ASTM Designation: Doe oee 


Rene = Usomececcnueco Sire 


Compute the ninety five per cent confidence limits: 





AR r 
ee ses See cos 


At the ninety five per cent confidence level, test 
me fyeOoucmesis chat. tne APL Gravicy measuremenc oN he Den Ger 
Phe AOR cCOry 7 ClonctAatloetColD ye tne Same =o ele Ismet 
Sev iil enewGanpl le. UbSeCe cing aim Cee) 


60052 = 6G75> 


J 


i ene — MSG ibeeween 6S. 0> "and O69 >>. aeCoece ene 
hypothesis and conclude that results obtained for this test 
iy Ga DOraLO-Y Fearne Sueatitetlealiy eecurace. FE “ehe = is 
less than 69.05 or more than 69.55 reject the hypothesis 
and conclude that results obtained for this test by lab- 
S@pauOry 9 have errors ettribucable “wocassigonable causes, 


ne Mypechesis, 45 retlec2 co for  Cworvasues. 


x 7 oo ee OES 


x. Pe Oe Oa 





ic Ci ceOrelNG Career. OL. OUGl Vind, cate On esuimares 
O£f population parameters was discussed in Chapter III and a 
eplalimeand error meLhoo or eCliminating outliers from, compu— 


EetcilOne OL “he mean was described. Applying this method: 


ime) 
> xX. - X, - X 
A it 6 = 
a Do ee 
Us, 5 3 60 


Compute new ninety five per cent confidence limits: 


As R.A, 


Oe eee = eee 


Substitute in equation (4-6) and retest the hypothesis for 


all ten measurements ae 
65.35. = < 69.45 
The hypothesis is rejected for the same two values: 


Ay ee ee Oo 4 


I 


Xx 


2 C276 ol.) 


SInecermo-a0diutoOnal GCete Sorines Were. -reVeocted 406 Wole 
previously re jected were picked up, a stable set or values 
has been determined. 

This is the method presently used to evaluate cor- 
relation test results. It has been previously pointec out 
RNe@ewealSs MeEcnOG GIVES NO. {ao teet Ion (Or Woeries Sy seeuseie 


Srrors or mistekes ere che causes OF OVC—Or —cCOoOnLroO!l oserve— 


ales. 





TESTING PAIRED OSSERVATIONS 


Piseuss Lon 

Just as a minimum of two sets of observations were 
required to establish an estimate of the precision of a test 
method, two ooservations are the minimum data required to 
estimate the consistency of a laboratory's results using a 


proven method. 


Precision Limits 
Two, ObSet Vat lOns Gan  ec-anedly2e0. for. prceltelon by 
estimating che standard deviation from the mean difference 
between pairs. Precision limits for u: 
uw = X = Zo (4—3) 
CONELASCHCe BME rval sor 1X: 
Ci 2G) ee (4-5) 


bee the seonr i Gence Taence, = 20) When Ger cerseans Lox 


@a given vorobability level, be represented by the symbol ZC. 





The paired test resuits from one iaporatory are represented 
BY wes Jono Mews “PRE Sample meen. Gs, 2S Biesestimatorvor tence 
Aj B j 
population mean. Then: 
Cot Oe) ee Ce ee a7) 
3 To Oar 3 
SUS CUCU rer As 
J 
= “ny >= —_ 
| ° Fe ay oe . 3 xX ‘ 
fai “Bi! eax. ee ee (4-8) 
5 = —— ce Z " ma - 4 _ 
Za Sas Ds 


[- 
ee 
coo 

a 
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Ciearm@ag, £reacticons: 


a me fp + 
(Xa Xp Zee = 2) S (Xp Xp Ze) 
PRO wLace rnd <a ws 
u G g Xp. 
(Xp =e S (Xp 3) S (Xp Zo) 


Suortracting ae 


mr ansoosing: 


(Xp - Xe) < = 2C (4-9) 
Bnererore: 
Phe en eee OP. eet Ge eee 4-10 
A 34 = s 
Likewise: 
Vase ety Ge kee ern eee ee ee) 
j pe 
SuUOSELEULINg cor Xs, Gua ect eo eG Ons, yolos soon lie ao eng 
Xp y: eee temo Desi. (ce sme: 
at ae ee ee 
BJ ye 
And: 
} 
3 3 - oe ooge 
ic 
t t ! | 
eo ene ee 
pie A 5 ie Biy 
tThererore: 
ee) ee eRe ae (4-11) 
{ 





at the Thinety five per cent conrticence level: 
as us 
ee A ee potas Se coe. 
“Aj ~ “BS as 


Js 
NO 
won 
te 
6 
1 
us 


eee . hohe : ca g oe ane > 4 Tage “ 
Timer awce Jie. ceo teen) 2S seo Eo CooL 
af war @ waist Or CHhServVver_ cus { %7 a V awa 
— uae ena aoe an ee Noor AS oe ee ed Lwin oe a Bi? Cl Ss 
ay 2 


one and tre same value. &f they are, further si 


inferences may be drawn from them. 


t+ 


ps —~ = * we oe ee Pan stale - ke 7 Suse 
StlLMeting SvVSGeNe llc .r7or. L tne cryo 


Fa 
ry 
O 
3 
Q) 
- 


£2 


obpservations 


me ae lteHCant. Oe Macs Ct: Tame. about fr a. -O=-SCliOon OL 


- 


a 
LJ 
Ud 








to 
t 
ee 
I! 
Be 
Ee hit? ve 
Ud 
| 
ba 


ae 
pals 
- 


mMeEncS wWnich are not statistically precise, there is 


eeagmeapi lity Enet eieaes “On \ DOrn of Che measurenents 


4 


SC 2ecer yy. SOW "eS eCeo Obese Cocleo OF DreciSicl. an 


Ssystenati 


(4-12) 


(4-13) 


-ithouch constant ractors may be present in measure- 


@ high 


also 


Con vain errors caused by mistakes of unknown magnituce and 


@irection. A "biaes' computation would be meaningless in such 
Girecunstaences, could only cause cormrusion and shoula not be 


mece, 


¢ 
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Boccuracy limits, The stancard deviation of tne means 
of samples of size n is estimated by dividing the estimated 
mepulation Standard deviation by the square root of n. One 
possible way to define accuracy in a normally distributed 


maou lLation. 1s: 





ee ae (4-14) 
where : 
u = the true value of the property being measured 
G = the Popular en seance.s cevlelcion 
n = the sample size 
Zz = the normal deviate for the desired level of 
Contr icence 
oe is eae 
a 


BUt- Once eCcarn. he Sr O>oer GnoOVce Of B@ SEeEISGeic OF 
f = ¢ aS ae 
estimatoz is dependent upon the available data and the 


mieeciced. DUurpOSse TOY whitch le 2S eo he used. FOr smaid sampies 


O 


meceOlance Of tne Ay oOEnesis thet tne Samp. .e mean end Tae: Erue 
value of the property being measured are statistically one 

Boe ene Same value @4 Ene, basis Of Ghee vebove ese ay occu 
even wnen the situation is not true. For examole, assume two 
observations are obtained of a proferty whose true value is 
memo  tieoCaW Der reaoly Geer Ta. £egebhaless Of macnn eucs, 

if the two observations have the same value but opposite signs 


the average will be zero. Although the mean of the 





oa 
PS 


Boseryvecions would be statiscically “accurate” tre hypoth-= 
esis test is obviously meaningless for such a situation. 
One simple way to handle this dilemma is to tie the 
PeeCULAGy OCCeE MI peclOn He Ere, SreCcrsven ceSses ie ayooch— 
esis test for accuracy of a laboratory's test results would 
then be modified to the extent that X would be redefined as 
the average of a set of laboratory test results which are 


statistically precise at some specified level. 
Procedure 


Data. The raw data required are the test results 
for a given property obtained from two samples which have 
been divided and distributed among the m participating lab- 
Seacories. tc iS not necessary that both samples be of the 
Semle PrOOUGc. LE may oe Teast bee GO pool Test results of 
Sue Lerene products. VOikK: Staces that, in compar ne paired 
data, the pairs do not nave to be measures of the same 
thing, but the individual measurements in a pair will be made 
at the same eonditions.~- THe GOjSCelLve: 2S GO [Sayo1d 2-2 o— 
Sueing acecleione!l sources of Veriebility. Generally, chis 


raha 


objective can be accomplishea i= the test procedures are 


identical and if the samples are reasonably ciose in the 
ae 5 ee oe | | : ae 
magnicuae OL Cne* proser ty, being evelvated, rove Ver,.even 


though pooled test results are ontained from statistically 


homogeneous samples, if they are not duplicate tests of the 





a 


same sample, they do not have a common mean. Consequently, 
the observations ie and oe Canmael be Comeered Ga rectly. 


The algebraic deviation from the mean, Vag HOD Coach, Oaseu= 


vation must be determined by subtracting the mean, Ke, 
computed for each test from eacn observation revorted for 
mat test. 
oe = ae - X, (4-15) 
MOper Wit De Sas Geoue Emo “SOG! Inc. .Os ca ca. co Form 
waroccr Samples in the section on multiple test results. 
Correlation test results of the ten per cent distil- 
maelOn pOint Of Lwo Olirerent samples of aviation gasoline, 


grade 115/145 provided the data which will be used to 


illustrate the procedure. These two sets of values are 


given in Table V. The results labeled as Test 1 are measure- 
ments taken on correlation test sample 64-27. Those labeled 
as Test 2 are measurements taken on correlation test sample 
64-3599. The corresponding matrix of observations, v.., is 


7 
~ 


even. in -lTebie Vi. 
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LE=V9 
G* O+ VS 
G*O+ oe 
Oe. Vv 
S*O- O20) 
G* T+ eee 
So 9° 9 
0. C= Se 
G*O+ 9°0 
oo O= 9°O 
G* C+ oC 


reer eee 8 ee rem ee 


LNAO Wad OT 


x a 


SR 8 5 em ee 6 i Ce Le ee Se ee - S48 ee, —— 
te en RN NN A ae A rl a et : 





e—~Tdmes 22 ASsey, 

oTdwueg eT Ase, 
GE HACE CH OT 
Gr CHONG. hs 6 
Gre ire see 8 
SG ec LO ie 
Cr le 28> a 9 
S°V- 8° T+ G 
Cee "Odin v 
C O+ 8° O04 € 
CEE ae Os C 
rosea I 

a | Sons [f *gqey 
T 4501 
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Ho © GCHATHORU LNA 


t “¢ 

CG 4504, 
€ _~ = 
°T FSoL 
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COO: a eee One 
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Se ry ee wae eer om 


[ 
AJOL IOS | 


wae Ol Siro 


qOVdD aN OS) NOLIVIAY sO NOldy ri bbs id 


hn aay 
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MosvinOcCLOue. Mile tyc le "On ehe dace 1S based 5en ene 
following assumptions: (A) The sub-divided samples are homo- 
memcOus, eat, Loe schere VS Nol Cualtcy variaci on .On, Line 
Macerial distributed to the various participating lLapnora= 
tories for each test, (B) The universe of observations for 
Perel ictivity and all-activities as normally distributced- 
and, (C) The test procedure has been proven, that is, it is 
adequately described to preclude general misinterpretation 
of the exact procedures to be followed. 

BeecisiO’ Rules precisions he ASI’ reprocuei pili ty 
amount, R.A., described in Cnapter II, can be substituted 
for the ninety five per cent confidence interval range 2C 
may {4-il, eS a standard to test the statistical precision of 
the pair of test results obtained by each laboratory. (4-11) 
then becomes: 


Cee oe ee (=e. 


— ts 


aan the cecision rule is; 

Lf the absolute value of the difference between 
the deviation from the test means of two indevendent 
measurements is equal to or less than the ASTM repro- 
GUC MPI Vey “Amour FOr the Test. cenecitige £net Fresuite 
obtained by the laboratory for this test are suf- 
ficiently precise, i.e., errors affecting results are 
probably due to chance causes inherent to the pre- 


Servcec test. mMertnod, Lf the absolute difference is 





Oveate “chetethe-ASlMareprocuctbi lity amount, .con= 
elude that results optained from performance or tnis 
test by the laboratory have errors attributable to 
assignable causes with a five per cent risk of being 
ite ORG 
metermine the ASTM reproducibility amount, R.A., xrom the 
Standard Method of Test for Distillation of Petroleum Pro- 


euccrs, ASTM Desicnation: Deemer 


Be ee oe 


mor the m laboratories, compute: 


is 
| 
< 
ll 
ae) 
C 
O 
3 


At the ninety five per cent conficence ievel, test 


5 


Se ase © 


Os 


eae Nhyopornesis chat. che cen per cence In O@ sore 


To 


it 


4 


aNGteuery j-are. Sroce 


}-4 


« 


Measurements Xs ail CaaGn 
+) 2) 


focicCally the Seme. in respecte co Ltheiy deviacion .comn tne 


ae Ome e Cugo.. 


Bruc’yValYes OL Une ten: per cenit Gisci tation poincs* of 


5 


samples 1 and 2 respectively. Substituting in (4-16): 
’ ' 
ees 
V,."7~ V ef ~ 
, 13 cae 
| 3 


i 


Bo ere OnSsollile Cie re rence be oween and is 


1 5 OF 
Scgual uo Or Lless.tnen. 7, accept ihe nypochesis and Jconciuds 
Cac .eSiles Cp aA Neo 2Or Edis Gesu by Labora tee, | ere sur 
Eicienely precise. 1f tne dirtterénce is ¢creacer than 7, 
Be jecu “ine Nypoecnesis ana concluce that-cesultis optaines 


for this test by laboratory j fail to meet minimum standards 


for precision. 
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- 
ie = 25) 


results are tabulated in Table VI. The paired test results 


The differences, V for the illustrative test 
maom all laboratories are precise according to the estab- 


ished Standard. Consequently, all may be further analyzed 


men everage bias and for accuracy. 


Biles MWeasurewene. “Ne mMeanqdevietion- from the. mean 
meeenic Dalred test results reported by laboratory j 1s deter= 
fomed by: 

me fee ee 

te eee | (4-17) 
Myr1s is equivalent to (4-12) for the bias estimate based on 
mae Observations Erom a laboratory which shows an acceptable 
degree of precision. The values v computed from the illus- 


Peacive data appsar in Table VI. Tnese values will be furthez 


Meili zedcin testing the accurecy of the Laboratories. 


DEC On-giu.6 2 wecelsdeye Artese POr geese y. te 
given by (4-14) in which X is defined as the averace of a 
eee OL laboratory test results which are statistically pre-= 


feeoo at “SOMme Specified level. “Sudseicucing = Tor Mean 


J 


v.. Lor p, (4-14) becomes: 


J ~~ 2) won 


Buc Vis is zero by definition. Therefore (4-18) becomes: 


v <0 - 22. (4-19) 





Ut 
©) 


The ASTM reproducibility amount, R.A., can be sub- 
stituted for the ninety five per cent confidence interval 
mange — so im!(4=19) asla standard. to test the statistical 
mecuracy OF che palred test results obtained byveach labor= 


atory. Also substituting for n, (4-19) becomes: 


R.A _ RA 
_ FO Li = 
2Y 2) = GS BAZ (4-20) 


and the decision rule is: 

Tf two single observations obtained from statis-—- 
tically homogeneous sources are statistically precise 
at the ninety five per cent level, and if the absolute 
value of the average variation from the mean of the 
paired single observations is within the ninety five 
per cent conEi1 dence rance based on the aepolicable 
AS IM -Reproducibilicy amour’, Concluce nat cesults 
obtained by the laboratory for this test are accurate. 
Pte ehsoluce Vvellectom eevee ego: 25.5 0o,6 ven Delo 
Terie y 24 Ve. Pen teem Coie Cone ance, Gone oes 
that the results obtained performing this test con- 
Lom eu eOre wep secon ne. De ineCor st tee Tor, Gna 1ce 
Gauses With a Live oer Cenc risk Of having seached 


wee tweOrG GONGanSs LO... 


(cir 
" 
os 


COMDUCe ENS B28iety Give er Gans Cont toence Lams 


oe ee) re 


<7 0 fF ee = 2 2 
2 








At the ninety five per cent confidence level, test 
the hypothesis that in regard to deviation from the true vaLkLue 
of the property measured, the average of a pair of measure- 
ments 1S Statistically. ene same aS zero, Substituting in 
(4-20): 


esto ee Tt eae Ge 


ie eae me iS between 225 sene 92.5 sccene the nyoor n= 
pees end Cone jude wnat, om che everege, results obtained for 
mais test by laboratory j} are sufficiently accurate. If the 
v iio lees hemo 245 on Greate, tran 9225,. vosect ene hyootn— 
esis and conclude that, on the average, the results obtained 
fer this test by laboratory } nave errors attributable to 
assignable causes. 

Dp yeetio-sis, sSeeceepeeo £6, Une tom teperacerics 
mothe Gxeample but laboratory lais on.the borderline. 

While these resu.ts produce a quick and satisfactory 
Meacication OL accuracy, cney do not make full use of the 
available information. They co not take into consiceration 
the probability of statistically indevendent events. The 
outcome of either of two separate laboratory tests is not 
eenditioned by the outcome of the other. ipexeeore ooserva- 
Poi enwdiar Ouse VetiGn eB rere Srattsc teal iy i ioSpelcene and 
mien PeOoaDILLey yon (bose Avan. Bs OocecurriIng is" Tne Drocuci: sor 
Eneworobabiiity of A occurring end therprobability of &B 


Securring . 





1 
NO 


Pak end: Bes ee eter 2) (a2) 


The hypothesis test employed assumes that both obpser- 


vations (either the X,, and Ge revlicate measurements or the 
a) 
5 and V3 single measurements) come from the same normally 


distributed population. Therefore the distance from tne 
Bepuletiton meen Ob Gach Coservacion can be expressed in 

m= OMS Of Mr I Doles “Or the populacion Scanderc deviation, 

mae 85 see ynioOuma | OeVtace,) 256 fae area UNGer Che moeoucnc, 
eaStribution- curve, bounced by the interval cz which includes 
Z,, measures ENG PerObd Oi pVEy Oe OOLalni ng GbServeciones 19 

@e random Semple as Shown in Figure 4-1.  Likewice, the area 
Bice e scene min eeOUeNey CLG Cr OU EVvOuuecUryo! WBOUNCSO! Syueme —17 co. — 
val cZ which includes 2. measures the probability of obtain-— 


* ” ares ic e aS a = a — —~ «6 ea; 9 apy e 
ime Ose Uver tens fea semcon  Satpuoe. (Nt io enone els ce tou on 


—e 4« 
> 
a 








Pree. sees . 
S . 
7 a t t 
| 
6.2 ae C2, 
UL 
TAS PROPABLUALY OF <Q INC A - CILVEN VALUE 
FROM A NORM. CISTRIBUTION 





ne oe Gm Set ey On Occ ott nits .6 DPavelCU lar Value lOreZ cin 
Maisie s oS 2. 83Cheaces. BRSrelLorc, ENS prove o. ity on 


Soreinuncg Ewo Observac:Ons Out in “One “Or cne oOtner Cail of 


mane Gistribution due to Cnance causes alone 


Benversely. Ene Pprobebilicy of 


close to the population mean if only chance causes 


affecting the measurements is reiatively high. 


a 


8S 


1S Very small. 


ODES 2 4G WO ObsSe. vari ons 


Given two sets of results, (Zn a ene oy Zn = 0.0) 
and (Z.5 = he oe Zao = 1.96) one would conclude intuitively 
miec Lesulcs from laboratory J are more apt tc be accure 4 
mga, VeSULes route 0O-alLOry 24. Sniceco Pc. can. be (shown 
Met: Jr. ]@ Bon ice Zoi ce vel sor, OC .02 15 Sense CUucea 1Or CZ, 
tne probability or obteining the subset of measurements 
(Zns, Zn) que to rancom variation is more than six and a 
Wee “ERO SueSs Cred as — 70 Deo oao. ity eh. Oote 22st ene SsuD 
set (2,5, 225)- 

Pi. wee: ee Smee ee ee ale. te ee 

eS ee glia eC eee eee Os a 

Pr (Al and BL) = (.0012){.0080) = 6.$0(107°) 

Be ee We i ee eo, ee Oe 

Ba, Pe eno = Or = ZCI 

Pr (A2 and B2) = (.0012){.0012) = 1.44(107°) 

Ne “Consecuences 61 apely2n0g tiest tule {Go Not. a Doce: 
to be significant enough to tustify che considerable extra 


effort required. However 


weal -v sy 
Einee Foe rT a 


effect should be noted. 





U1 
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Ve We ortmisOm ERS Seat Olt e OL COnt Ccemce evel, Gas sreo— 
SOs ak Of Sa @lSerya lon &.Creetes nan 25 05 and an 
observation B greater than Zo 05 oe OeeOOL Ose Sinem. Gore an. 
Therefore the Gecision rule carries a risk which varies 
meme e es] iO O20072 198 Ween el escityi Ae an, Veeccurtere 
Beales AeCcumote, <Onverzcely, oS tes OF Faaling 


MOOG eCt. an * IMIsCeUraere Celt vity. 3S: i ereasec.. 


by 
O 
Wd 
0) 
ts 
3 
a 
I] 

4 
(> 
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Re StN Gamat ee jo 


prSeuss ton 

OOS eee eee eS ee GUO Ben ye 
Oratories as represented by the matrix of Table VIL. Assume 
Bee Pee oO INSeT So -Ob. Coser Vac ONS 2er CoCn bose (26 Ose 2 
eisSeripucec. Tne Objective 2S LO cetermine £re Kind ana 


7 © FY © » 


Maco uOS On Verte Oulu 1s een oS ese e lec, =o vein UGed 


In ObServaticns Mace by a civen 1 C2er 7 ee Bee ime 
asure GPO otee yy Oe, eee eee Aa Wee oie te oe 
Sees Be Conver ee ore {Cer mMeooUre lente OL, vest o elon 2 
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lim S is positive, the laboratory meets the minimum 
standard established for accuracy. The larger the.value of 
is. the higher the degree of accuracy. If oe is nega- 
eave, che laboratory does not meet the minimum stancara. The 
ierger the negative value 1s, the more inaccurate ere the 
mesults obtained by the Laboratory. 
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ine Be ate is positive, the laboratory meets the minimum 
pieanoanorestablisned Lor preexzcivon, Tne tarcer une value oF 
mee a the higher the degree of precision. If pater is 
negative, the laboratory does not meet the minimum standard. 


The larger the negative value is, the less precise are the 


meoults obtained by the laboratory. 
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ANALYSIS BY A GRAPHICAL METHOD 


A gGraphicel method for evaluating new laboratory test 
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FIGURE 5-1 


DEVIATION FROM A HORIZONTAL OR VERTICAL AXIS 
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FIGURE 5-2 


OUADRANTS FORME D BY, THE. INTERSECTION 
OF TWO PERPENDICULAR AXES 
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FIGURE 5-3 


Tie -bOCUSSOn WB XPSCTED VALUES FOR ALL OBSERVATIONS a B ) 
AFFECTED ONLY BY SYSTEMATIC ERRORS 


The locus is a straight line through the intersections 
o£ the A and B axes bisecting quadrants I and IIil. 

As the next step, recognition is given to the existence 
of chance causes of variation which will cause deviations from 
the locus just described. Excluding the possibility of mis- 
takes, a data point (A,B) is now expected to fall not on the 
forty-five degree line through the intersection of the axes 
bUtIw ie hin-ron area “surroundiImGra-CGiven CoOlmwe On sane anes 
The maximum amount by which a pair of observations can be 
expected to vary a stated percentage of the time solely due 
BO eC lance “causes cam be determined and eae circle of -statisci— 
cal confidence can be constructed around each point on the 


line. 
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Bene CONSiStent recurrence Of scattered paired data 
BPelMirsr within such a“circle centerea on Ene Intersection 
of the two axes would indicate highly reliable performance. 
The observations would be considered accurate because they 
are clustered around the true values of A and B. They are 
acceptably precise because they vary only within the limits 
of the established performance standard. The consistent 
recurrence of paired data points within such a circle of 
confidence centered far out on the forty-five degree line 
MOuMlaumAacCate wane accCeDealle vaeqTeSe OF PreciSilon.bur Poor 
accuracy. The accuracy is considered poor because the 
paired observations are centered on a point far removed from 
the true values of A and B (Figure 5-4). 

SINGS Stine. EOrey-Ltivye eonece line wise the locussor an 
infinite number ea oneatiey EACNCITGlesi1Om eOnt =Gence around 
them become a confidence band bounded by parallel lines on 
each Ssice Of Che EOrty-Live decree line ae a perpendicular 
eiecancenecual jEOtEnS actus et Geile vol nelenor “cont icence 
(Figure 5-5). 

As a final consideration, assume the existence of 
a large group of laboratories, each having only one opera- 
tor and one set of equipment. Also assume once again the 
existence of only random errors so that all data points will 
cluster about the intersection of the true value axes. Two 


variances can then be determined. The repeatability 
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FIGURE 5-6 


PLOT VOF “PAaLRED CORRELATION TES) MEASURES OF VAPOR 
PRESSURE OF TWO SAMPLES OF COMBAT AUTOMOTIVE GASOLINE 
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Scttmee Conridéence Limits 

Determine the radius of the ninety five per cent con- 
fidence circle, XQ 957 bys SUbestiLaeiGn Vint eouarlons (556) “Or 
(5-9). If equation (5-9) is to be used, determine the 
Reproducibility amount from the applicable ASTM Standard 
Method of Test. 

The R.A. will be used as the basis for computing XO 95 
for this example. From the Standard Method of Test for 
Petroleum Products, ASTM Designation: Doze s5e) pac hee 


for automotive gasoline in the 5 to 16 pound vapor pressure 


mange: see Se. SUSSELEUeIng 1a (5-94 
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On Se 
Construct the ninety five per cent confidence circle 
for accuracy around the intersection of the median lines A 


Sich We tne, che Lacie ia With parallel ‘rulers. con- 


CPE be 
Struct a forty-five degree line (line passing through the 
ice r sects On) Ol the median lines and bisecting quadrants I 
and III) and ninety five per cent precision confidence limits 
parallel to the forty-five degree line and tangent to the 


MInety five per cena -Girele for accuracy. -~ Filcere 5-s 


illustrates the completed graphical construction. 
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Plotted eee can be interpreted from either of two 
viewpoints. The general distribution of data points is of 
interest in determining the likelihood of sampling errors. 
mine Ocatilon Of individualidata points 1s the basis for lab-— 
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cemeral Diseri OULE Oneon Date Pores 

if yehe only €rrers aerecting tne deta were random 
PEmOLSL, Or rec iouon, pOsobe veranda negsaeive Curors would be 
relatively small and would occur with equal likelihood. As 
SPreslle Gata DPOIneS Ghoula be jex pected to berrigntly, 
scattered, more or less equally, in all four quadrants formed 
Dy the intersection of the two median lines. This is the 
1deal situation, and is unlikely to occur. Individual lab- 
S@rauOlyroiases Will normally cause Waporatories to Gprain 
results on the true samples which are either both negative 
@r Och positive in relation. Ce vche’ medians A concentration 
of data points in Quadrant I and Quadrant III can therefore 
pe expecreed.) Ine More pronounced Cais tendency Lo ineividud! 
mies, the greater the-departure will be from. tre®i1dealveir— 
Sulake Glstriou lon, 

in the event that the paired observations are nearly 
ecucuiily  divlded among Ene Lour  quadranks (the. possibilary oF 
iva lia Gata vesuliing Lrom.e sample distribution error Should 


be considered. 
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Solymeme culate ena Most probably armisvaxe Nas “entered ante 
one of the measurements of the pair (the measurement of 
sample B). In the case of laboratories 4 and 9, the loca- 
MiG "Ol ehe deta ~pei1nes. Gor oS ONE te a mistake entering 
into one of the measurements, chance causes normal to the 
method (one out of twenty measurements will fall outside the 
ninety five per cent confidence limits in the long run), or 
We@r Precision “due LO modifications of the Test method or 
due to carelessness. None of these possible causes can be 


considered most probable without additional data. 


ALTERNATE PLOTTING METHODS 

Additional analysis of relative performance can be 
made by comparison of multiple sets of paired observations 
From each laboratory. These observations can be combined 
and displayed in various ways. Consider, for example, a 
subset of four observations, (A,B,C,D) representing the 
results of the same test on four different samples by the 
same laboratory. The alphabetical sequence indicates the 
Grronovecical feguence Eat Ee Peerct were pecrormed.- {Ene 
time interval between tests is one month or more. There are 
two logical ways in wnicn this set of observations can be 
formed into subsets of paired data. The first way is to 
combine the observations in chronological pairs, without 
Guplication, to form the subset (AB,CD). The other way is 


be COUlbine. ENE Observations I sCNPOdelegical peirs, wien 





duol@@at#on, to form the suvoset (AB,BC,CD). The ’eter alter- 
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SUMMARY AND CONCLUSIONS 


summary 


In this thesis, the author has investigated some 
Becerstleat Meas Of CbLaAInNING more dezImitive inrorma wien 
Bence ning ene- reliability Of military petroleum. teseing 
Laboratories than is currently obtained from existing cor- 
relation testing programs. Numerical methods of analyzing 
Single observations, paired observations and multiple 
observations, and a graphical method of analysis were dis- 
Gussee pe Proceaures were described Eereanalvyging and inter— 
preting the data by each method and were applied to actual 
Miteearyrcorrelatwon tese daca, 

Table XVI summarizes the tests which can be applied 
we. eceGm oclhy1 ty. 

le was £ound that EoOr alsingle epservation: one could 
Best tne (nyeeenesSsis au any pre—-selecreqccontidence Jtevel 
that the single observation is statistically the same as the 
true value of the property being measured. Since there is 
no dispersion to a single measurement it cannot be tested 
wOx BDGeCLS VON.) “UAeTe  O-e NO EUrtner anus ical von. can ce 
mace Of ~a,Cecision-ehae 4d Single observation. is Statistically 
inaccurate at the selected confidence level. This method, 


uSing a ninety five per cent confidence level represented 





ia 
LAaSBEE, AVE 
SUMMARY. Of Tists CP DABORATORY MEASUREMENTS 


Tests based on the ASTM Reproducibility amount (R.A.) 
provide confidence at the ninety five per cent level. 
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Mescts Of Single Observecions 


Hypothesis test for accuracy: 

















‘A Eee, regen. 
gs ioe ccs eae a Nea a 
MeSeS) Or what rea Observations 
Hypothesis test for precision: 
a —JRLA- (4-16) 


Hypothesis test for accuracy (if precision hypothesis 


is accepted): 


Ree _ mek 
— ee OV, ST (4-20) 
Seo ae ee 


Estimate of bias (if precision hypothesis is accepted): 


itt ees, 
v; = = (4-17) 


fiestas (Or Muiee Lolo Opser Val Lous 


Precision index: 


La 


Minimum Staendard 332 : 
P.tI.. = So (4-34) 


(exes 


(4-33) 











aS 
TABLE XVI (continued) 


ReEGUracy  1meex : 


Manimum Stencdard for | v | 


as ie ee ee ee OL Es. 
j AlCl 150 (4-32) 
J 
een] Ga Ger erg Pala aiee, 
J Jf J 
; A hi -Rser a RA 
Minimum Standard for | v5] = (Aaa) 
| J 2 Yn 


Estimate of bias: 


Bias estimate = ve excluding extreme values 


Laboratory Ranking Index 


eer wy ee a (4-35) 
5 ees cS 
i 
ee Lhe we lgnCing PacroreLOn tes. 1. aeterminea by 
Ehee Le leei ve “Silent cance Of" that’ tes: tor tne 
Operarctona! Dperiormancevon cae product. 
/\ 
AG ace 
Z., 7 (4-36) 
14 A 


Graphical Analysis 
ReG rie FOr Gir Cle 101, (GO tence se rEcecluracy ar 
precision: 


To.05 7 O67 ore) (5-9) 
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iy 
op yoeiowoe lhe erocuetoi iy ANOuUnt;. «1S Ene Current merned OL 
evalWenring correlation vese -data, 

When two homogeneous sets of Single observations were 
pooled and analyzed as pairs of data, the hypothesis that 
the two single observations of each pair came from the same 
PODULAtLIOn could be tested at the ninety five per ‘cent. con- 
fidence level, thereby measuring the relative precision of 
the two observations. If the precision hypothesis was 
accepted, the hypothesis that the average of the two paired 
observations came from the same population as the estimated 
‘\ . 

u could be tested to determine the accuracy of the measure- 
Meneses.) Mao lit. ONSene JPL lor Cone ClLOn enol ene. precession 
hypothesis was accepted, the average bias error of the two 
observations could be determined as an indication of a 
systematic error due to assignable causes. 

Wnen several homogeneous Sets Of single onpservacions 
were pooled and analyzed as a group, it was found that 
precision, accuracy and bias could be measured independently, 
Pet mrS pete VellLOLecy Ob) Ge. ceSe Of - One cua ltuy on whe 
MeacuLe Vem es Viad7tO CS penNcenece jon Lhe peaor “ou Lcone “oT 
2G Rt wea BeCivs OM ian ssceuracy wore ec mMeasuved fy -an 
Sesiiy -ineerpreced wncex Ccomeuted by Comparison Oo escab-= 
his ioe Mint ais Seaonmdaras. Wine Si16enveid tne MecmieuGe "or ihe 
index indicates the reiative goodness or poorness of accuracy 


OL ECC iS1On. 





A graphical method of analysis was developed which 
BOOM ewe se@ ly one: Simple muLeiplicetion: calcusetion Lor its 
iets ce ier ea elo lana iO mete ta wice le -ecaleu leat tonSsiinete— 
eer ne Gata ererenelyZec Jnl pol re requiring. .c) mani muim 
Simeewe scoestoOc ol ng le PObSerVarlonis.. Wien the fnetyois 25 
limited to two single observations it was found that the 
same limitations were encountered in interpreting the results 
when the pair of observations were not adequately precise 
as were encountered with tne numerical analysis of paired 
observations. Increasing the number of sets of single 
observations included in the analysis permitted more speci- 
Pee sa or Pree on.. Whetister Zines are pnicalk. mou nioe<Or 
Poel ySis) Ene wnOneogencley of ‘eta Sees Goule be veriticd py 
observing the general pattern formed by the plotted data. 
A separate statistical test or homogeneity of variance was 
required wnen using tne numerical method. 

Analysis by the graphical method was used to illus- 
Biece now che pooling of “homogeneous cest- daca sets 
Pheacreased ene “GrlLeceivenecses Ore enalysis OF PCOrece tation est 
me cUPeoraS sce Manecenel. LOO. OL Ene (Mel arery Commanee:. 
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in the final ana 


fn 


Vie Sop. Sete eee 8s ONO Ch eo ey 
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Le \COSCS SOc esl ee SC So cer tiSs 


rh 


in performance of sveci 


Sber I ECOLreCelLVeCieSSic Yigg 1o SLOduCe 4s 26: SUleeplitcy 2or 


sea 


Wee. init RoC. Our tar ne’ TeoOracori1 eS acCOrcinge co. enews 
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ee 
assogmeced with a single product was therefore ceveioped as 
a usertul) imenovement onthe Summary on Laboracory Peri ormence . 
Phe mMeenod, provides £Or. Lhe Gomputarion ef a Laboratory 
Ramen Gost naere Wie it Gye. -2OmpOsi CSO, "ner Co uenLve Cecuracy 
@ 
of measurement of the various properties of the product, 


ned ne eas-i. weCerearce Wile “anew e Signer teance 1 segard: co 


me @oehes 1Onal oer iOoumonece "Oe. tne prosuce. 


Senclusions 

The current method of analyzing correlation test data 
1S stetistically too primicive to provide the military com-— 
mencer with aedequate antelitcoence concerning the eliective= 
ness of the petroleum testing laboratories within his area 
On JUL LSaLecrion. 

Matnceimine co hich -cCecuce 400 Scclinac, lemons £ne petrc!— 
eum testing laboratories is the specific goal of a military 
@Ocreol Agt@iw vesting DeCorem. \ Bue eeGcureGy Soa fuUnce  On.ox 
DESC On enc. bles. By emelyzing (me eceutee, on “2 1a >5ere— 
mer S~WOLrk vin, LemmS Or Drecisten “adi cias “ne Worvelarion 
BeStInG tog an jee OS Mace? InLO © WOrTe Se  focrive: Manecenen l= 
by-exception tool. This requires, aS a minimum, analysis of 
paired homogeneous data so"i= or, preferabiy, anaiysis of 
malcaple nomegqenecous Gace =srs. 

Further investigation of the requirements of an 


SrlLeccive peor relation cesting rogram 1S Scroncly 2 econmended. 





E28 
This thesis was limited to investigation of some statistical 
methods of evaluating the reliability of results of labora- 
tory tests of petroleum products and better methods of 
evaluation were found. Many other facets remain to be 
explored before a complete program can be formulated and 
recommended for implementation. Evaluations of optimum 
meecmency OF CESts;, Cvaluagclony er -tney Signitricance of- each 
MeSe, INVeStroation OL Che validity of using the ASTM 
BPePpeoolCcil Dili y  amowie “aS. a Standare., anc. anveseldas1 On oD 
the relationship between correlation test measurements 


and routine test observations are a few. 
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